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Modtaget 

TITLE: Methods for rolling circle amplification and signal trapping of libraries 
FIELD OF INVENTION 

A method for isolating genes encoding secreted polypeptides from rolling circle amplified 
5 gene libraries Is described, in which the endogenous secretion signal sequences are 
detected using an in vltm transposition reaction, where the transposon comprises a 
secretion reporter. 

BACKGROUND OF THE INVENTION 

io Secreted polypeptides are highly interesting for the biotechnologlcal and pharmaceutical 
industries, since they can be produced recombinantly with a minimum of purification steps 
necessary. A positive screening system which selects only clones encoding secreted 
polypeptides is thus very desirable. Signal trapping is used to identify genes encoding 
polypeptides that comprise a signal peptide by applying a translational fusion to an 

15 extracellular reporter encoding gene lacking its own signal. Methods and protocols for using 
signal trapping were disclosed in WO 01/77315 (Novozymes A/S). 

Cloning and gene library construction using rolling circle amplification, mainly for sequencing 
purposes of in particular non-donable targets, was disclosed by P.M. Uzardi (US 6,287,824; 
20 US 6.280,949), and by Dean et al. (2001, Rapid Amplification of plasmid and phage DNA 
using Phi29 DNA polymerase and multiply-primed rolling circle amplification. Genome 
Research 11:1095-1099). Commercial kits are available comprising Phi29 DNA polymerase, 
for use in rolling circle amplification sequencing, e.g. TempliPhi™ DNA Sequencing 
Template Amplification Kit (Amersham Biosciences, USA). 

25 

The construction of gene libraries for cloning or screening traditionally employs intermediate 
amplification host cells, e.g. Escherida cotf, because the transformation efficiency is very low 
with raw ligations, especially into eukaryotic host cells. The use of intermediate host cells for 
amplifying and/or maintaining a gene library increases the likelihood of the library not being 
30 representative of the organism it was prepared from. There may well be a bias against 
genetic material, the presence of which could be inhibitory or even lethal to the intermediate 
host, e.g. genes encoding antl-microbial activities may be lost from the library. 



SUMMARY OF THE INVENTION: 

35 The problem to be solved by the present invention is how to identify those clones in a gene 
library that encode efficiently secreted or surface-displayed polypeptides, even polypeptides 
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ba,™ ,o ampft to libra* in an «' * ^ ' 

T^ona. and bme consummg assa». M would W 

5 s e«tedorsuifa»<«splaye<lpolypai*lesfroinne«arganisms. 

— *«» eWe amp»aadon procedtae M. ■ * 

MM. effcfcnoy to MM* *a aaad to H— » J"*°* 
„ .hereby enab«ng I. efficient identtcahon and IsolaKon of a pal,™**** encadmg a 
polypeptide of Meres, which comprises a signal aequenaa. 

Accordingly, * a M aapect I. inventen relafcs to a memod far IsolaBng 
r^aa a polypep»de a, imams, w*ah eompdaee a ^ eeguence ,0, sec„bon ar 
, s oartial aaaratian. the methad aampriaing the sewer*.! steps of: 

^providmg a DNA or cONA Kbrary M an «g.n*m product a» po*p«*de - 
M M fne library Is comprtaed In a **ar vedor and * priced a, >*> 
.ghoul ultnwlow eradiation erf 
b) ampWng <he library by roUng circle amplinoalion, thereby fending 
a) Inlng M. are »•» a. DNA frogmen, aompHtin, a promotes and eeora«an 

akmaHaaa polynucleotide encoding a secretion reporter; 
d, inducing the amp**, m composing M inserted DMA fragment Idle a boa. a* 
..H^^^^aba-ae^at^drpartany^theacwe 

secretion reporter; and 

„ fl IdenWymg *om the aeleded boat oall the po^nucleoUd. lata wl.cn tbe seorabon 
reporter area inaerted. and feolaftig the polynucleotide; 
„teein steps b) and o) may be performed in any order. 

Tbe terma -aeon**-, -para* secretes-, or ■membrane displayed" are used 
3 „ Merchangeaply berain and mean translation of a pert of a pdypeplide or of a **• 
„Wpep«de aoroas a n^brane of a cell sue* aa a prakary*. euter^ or «*aea ce.. 
„ " „on«ng exampte a. pdypepdrfe secredon. a membrane*ounrf ««-»"*"- 
protein such as a reaepw may In «re melbod of dte imrenuon be expressed . a host ; -I a 
. fusion po*pep«de M la h*ed - » ^ - ™ e *"' *" 

35 -aecrdlan- k Ihls cortex, meane ,rans,oca«on of ft. fusion po^papdde acmes a membrane 
oTmTboa, ael, to such an etfen, ma, a. leas, me secret reporter part of me Mj. 
polypepMe la displayed on the MM- side of the membrane and is runcbonally achve 
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in a secretion reporter assay. In other examples the fusion polypeptide may be completely 
secreted into the cultivation media without any residual linkage to the secreting cell. 

In a non-limiting example herein, cDNA or genomic DNA libraries are tagged with a 
s transposon containing a reporter gene. All in-frame fusions of the transposon reporter gene 
with a gene in the library containing a signal sequence are detected by assaying the 
expression of active reporter. The upstream and downstream flanking DNA sequences of the 
transposon insertion are then sequenced and the gene into which the transposon was 
inserted is identified by sequence analysis. In many cases, obtaining the full sequence of a 
10 tagged gene will be facilitated by the recovery of numerous clones of the same gene tagged 
in different nucleotide positions or sites. Positive clones are sequenced to identify clones that 
represent the same gene but have different transposon insertion sites. In this way all or most 
of the open reading frame (ORF) can be obtained by contig assembly. If a complete ORF 
cannot be obtained in this manner, perhaps due to an insufficient number or an uneven 
15 distribution of transposon inserts in the gene, then the full length gene may be obtained by 
classical primer walking DNA sequencing. 

The sequence information thus obtained can then be used to isolate the complete gene of 
interest including the sequence encoding the secretion signal sequence and further to make 
20 an optimal expression construct for industrial production of the secreted proteins, all well 
within the skill of the art, whereafter the Industrial production process of expressing and 
recovering the enzyme is a matter thoroughly described in the art as shown elsewhere 
herein. 

25 In a second aspect the invention relates to a polynucleotide encoding a polypeptide of 
interest, wherein said polynucleotide is isolated by the method of the present invention. A 
third aspect of the invention relates to a polypeptide of interest which is encoded by a 
polynucleotide as defined in the second aspect. A fourth aspect relates to an expression 
system comprising a polynucleotide as defined in the second' aspect. In a fifth aspect the 

30 invention relates to a host cell comprising at least one copy of a polynucleotide as defined in 
the second aspect, or an expression system as defined in the fourth aspect. A final aspect 
relates to a process for producing a polypeptide of interest, comprising cultivating a host cell 
as defined in the previous aspect under conditions suitable for expressing the polynucleotide 
as defined in the second aspect, wherein said host cell secretes the polypeptide encoded by 

3 5 said polynucleotide into the growth medium. 
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as and Tijan, Nature 316: 774-78. 1985). 
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.^ eta ae* refere to ^^^r^a^ee 
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double-stranded DNA found, inter alia, in linear or circular DMA molecules (e.g.. restriction 
fragments), plasmids, and chromosomes. In discussing the structure of particular double- 
stranded DNA molecules, sequences may be described herein according to the normal 
convention of giving only the sequence in the 5' to 3' direction along the non-transcribed 
s strand of DNA (i.e.. the strand having a sequence homologous to the mRNA). A 
" "recombinant DNA molecule" is a DNA molecule that has undergone a molecular biological 
manipulation. 

DETAILED DESCRIPTION OF THE INVENTION 

io The present invention allows the screening of genebanks or -libraries by proxy, for genes 
encoding secreted polypeptides or enzymes even of unknown activity and thus without 
known screening assays including polypeptides having antimicrobial or biocidal activity. The 
methods of the invention enable screening for polypeptides of potential industrial interest 
that would not likely have been isolated using conventional screening assays. 

15 

A method for isolating a polynucleotide that encodes a polypeptide of interest which 
comprises a signal sequence for secretion or partial secretion, the method comprising the 
sequential steps of: 

a) providing a DNA or cDNA library from an organism producing the polypeptide of 
2 o interest, wherein the library is comprised in a circular vector and is produced in vitro 

without ultraviolet irradiation of the component polynucleotides; 

b) amplifying the library by rolling circle amplification, thereby forming concatamers; 

c) inserting into the library a DNA fragment comprising a promoteriess and secretion 
signal-less polynucleotide encoding a secretion reporter; 

25 d) introducing the amplified library comprising the inserted DNA fragment into a host cell; 

e) screening for and selecting a host cell that secretes or partially secretes the active 
secretion reporter; and 

f) identifying from the selected host cell the polynucleotide into which the secretion 
reporter was inserted, and isolating the polynucleotide; 

3 o wherein steps b) and c) may be performed in any order. 

The present invention can be performed using any gene libraries known in the art, 
specifically it can also be used with gene libraries of viable but non-culturable organisms as 
typically seen in environmental samples. Processes of producing representative or 
as normalized gene-libraries from environmental samples containing non-culturable organisms 
have been described in the art (US 5,763,239). Accordingly a preferred embodiment of the 
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p«ent to-ettSot, n*les to a method of the M .sped t*—i the cDNA or the =DNA 
library is normalized. 

A preferred embodiment relates to a method of the first aspect, wherein genomic DNA library 
5 or cDNA library is derived from a microorganism; preferab* the microorganism is a fungus, a 
filamentous fungus or a yeast; more preferably the microorganism is a tttctenucn, or the 



Another preferred embodiment relates to a method of the first aspect, wherein the genomic 
„ DMA library or 'cDNA library is derived from a multicellular organism, preferably from a 
mammalian cell, most preferably from a human cell. 

Vectors . . . 

The present invention also relates to recombinant vectors comprising a nuctao acd 

X5 sequence of the present invention, a promoter, and transcriptional and translational stop 
signals The various nucleic acid and control sequences described above may be jomed 
together to produce a recombinant expression vector which may include one or more 
convenient restriction sites to allow for insertion or substitution of the nucleic acid sequence 
encoding the polypeptide at such sites. Alternatively, the nucleic acid sequence of the 

2 o present invention may be expressed by inserting the nucleic add sequence or a nucleic acrd 
construct comprising the sequence into an appropriate vector for expression. In creating the 
expression vector, the coding sequence is located in the vector so that the coding sequence 
is operably flnked with the appropriate control sequences for expression. 

2S The recombinant expression vector may be any vector (a*, a plasmld. phagemid, cosmid, 
or virus) which can be conveniently subjected to recombinant DNA procedures and can bnng 
about the expression of the nucleic acid sequence. The choice of the vector will typically 
depend on the compatibility of the vector with the host cell into which the vector .s to be 
introduced. The vectors may be linear or closed circular plasmids. 

30 

The vector may be an autonomously replicating vector, i.e.. a vector which exists as an 
extrachromosomal entity, the replication of which is independent of chromosomal replication, 
eg a plasmid, an extrachromosomal element, a minichromosome, or an artrfical 
chromosome. The vector may contain any means for assuring self-replication. Alternately, 
as the vector may be one which, when introduced into the host cell, is integrated into the 
genome and replicated together with the chromosome(s) into which it has been integrated. 
Furthermore, a single vector or plasmld or two or more vectors or plasmids which together 
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contain the total ONA to be introduced Into the genome of the host cell, or a Iranspoeon may 



be used. 



The vectors of the present Invention preferably contain one or more .electable markers 
- \^-.L*. ■.»■"•-"'■* AsetectobtemaAeriaagenelh. product 

«<* (omiWn. carbam^b— se). Mr «<*-«-» 

(orl^'-phosphate decarboxylase). eC (auHato adenyltranalerase). frpO (amhmntlato 
15 • as equivaiento M to, use In an Aspe*«« cei, are the 

Streptomyceshygroscopicus. 

The vectors of Ihe presant Invention prmerably contain an MM ** ■»* 
20 tattespaRonof the vector Into the ho8t cell's genoma or autonomous replloation of thB vector 
in the cell independent of the genome. 

For into,*** into fhe hoe, col gen»ne. the vaoto, me, rer, on one rardeto a* ejence 
LZ the pol*epOd. or an, omer etotoen. - « vector for afcbto - * 

. r , ntnthaflen0 mebv homologous or nonhomologous recombination. Alternatively, 

« the genome o, - M ce, The addluona, - — 

as 100 to . 500 base pats, preferably 400 to 1.600 baa. pairs, and most preferably 800 to 

He k U - hom^ous recombWon. The H*** — ™^ 
■ ea.uence.ha. is bcoolog.ua V the targe, seguence In the genome of Ore boa, cot 
35 ZZZ the in.egra.tona, etomer* ma, be non-enceding or enuring mrfetc «* 
Zncea. On me other hand, one vector ma, be mtograled into the genome o, one boa, 
cell by non-homologous recombination. 
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For autonomous replication, the vector may further comprise an origin of replication enabling 
the vector to replicate autonomously in the host cell in question. Examples of bactenal 
origins of replication are the origins of replication of plasmids pBR322, P UC19, P ACYC177 
s and PACYC184 pending replication in E cdi. and pUB110, pE194, pTA1060. and pAM B 1 
permitting replication in Brta Examples of origins of replication ^^^<* 

-^rarethT2Th^^ 

and the combination of ARS4 and CEN6. The origin of replication may be one having a 
mutation which makes its functioning temperature-sensitive in the host cell (see, e.g., 
10 Ehriich, 1978, Proceedings of the National Academy of Sciences USA 75: 1433). 

More than one copy of a nucteic acid sequence of the present invention may be inserted Into 
the host cell to increase production of the gene product. An increase in the copy number of 
the nucleic acid sequence can be obtained by integrating at least one additional copy of the 
« sequence into the host cel. genome or by including an amplffiable selectable marker gene 
with the nucleic acid sequence where cells containing amplified copies of the . selectable 
marker gene, and thereby additional copies of the nucleic acid sequence, can be selected 
for by cultivating the ceils in the presence of the appropriate selectable agent. 

2 o The procedures used to ligate the elements described above to construct the recombinant 
expression vectors of the present invention are well known to one skilled in the art (see. e.g.. 
Sambrook ef a/"., 1989. supra). 

Since the rolling circle amplification of the invention generates concatamers as an end- 
as product, it may be advantageous to convert the concatamers to monomers. This 
monomerization may be achieved in any manner which is convenient. e.g. by introduce at 
ieast one restriction enzyme cleavage site in the vector for subsequent restriction and 
circularization; or by introducing at least one cos site to enable cleavage during phage 
packaging; or by introducing at least one recombination recognition site, such as the Flp 
30 recombinase recognition site, for subsequent looping out of monomers using the Flp 
enzyme. Accordingly, ina preferred embodiment the invention relates to a method of the first 
aspect, wherein the vector comprises at least one restriction enzyme cleavage site and/or at 
least one cos site and/or at least one recombination recognition site. 

as Another preferred embodiment of the invention relates to a method of the first aspect, 
wherein the amplified library concatamers are converted to monomers before perform.ng 
step d)- preferably the vector comprises at least one restriction enzyme recognition srte. and 
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recognM * and the oencaftmere a* cozened ft -none™, by 

nomoftgous .eoombMon. tn.di.ftd b» ft. »cog"«on »«aa and a epeofte racomtanaae. 

' » „ we. Known m tha art. that auperted drottar pot,nucUM.a have a higher 
mttn-ftm affidanc, than ft* .elaxed cuntoparts. Aooordingt,. * a prefers 
•MM «« invention rafcfta ft a method of the M asp.* wh«* ft. monotnors at. 
droulanaed and then treated with a DNAtopoieomeraee. 

' Vet another prrfarred «nMt«* of the ewenSon ratataa to a method of tha mat aepeo. 
ZZ ft. L« conrpdees a, teae, one ooa * and wheratn s^aen, ftetepe h, * 
tfta an.pm.ad mm, ooncaUma. a. conned ft monomer, pdo, ft .ftp d) by one ade 
cleavage during phage-packaging. 

15 ln some instances, better transformation effidency is achieved when concatamers are used 
rather than monomers, e.g. for many BaC/us cells this is the case. Aocord^y, one 
preferred embodiment of the invention relates to a method of the first aspect, where, the 
library is introduced into the host cell as concatamers. 

20 A DNA fragment comprising a promoteriess and secretion signaMess po^udeofide 
encoding a secretion reporter is inserted into the iibrary in step o) of the method of the first 

25 so a preferred embodime^ 

step c) is performed * vflro. A further preferred embodiment of the invent™ relates to a 
method of the first aspect, wherein the DNA fragment of comprises a transposon, preferab,y 
a MuA transposon. Still another preferred embodiment of the invention reiates to a method 
of the first aspect, wherein the DNA fragment comprises an origin of repl.cat.on wh,ch is 

30 functional in the host cel.. preferabiy the origin of replication is funeral in E~ «* 

most preferabiy the origin of replication is colE1 . 

Another preferred embodiment of the invention relates to a method of the first aspect, 
35 wherein the secretion .porter is a protein which, when secreted from the host ce , a Hows 
said ce.. to grow in the presence of a substance which otherwise ,nh,b.ts growth of sa,d cel.. 
preferably the secretion reporter is a p-lactamase or an invertase. 
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30 Tha inuoductlon of a .actor Into a bacterial boa, ce. may. for MM I. 

p^plas. banstormadon a.,. Cbang and Cob., 1979. ~ <^ 

168- 111-115,. ualng oontptf.nl ca«a (aea, a.g.. Young and Spten 1961. Job"* of 
K 0^8*. or Dubnau a* MUMMn ,971. Jm* « 
,s .^oboporadon (a... a.g.. Sh^wa and Dower, 199S, afofa^a 

6^75,,, or ooi-gadon (.», a*. Ko.fr.a- and Tnoma. 1967. M of aacfa^og, 

169: 5771-5278). 



10292. OOO-DK 



11 



The host cell may be a eukaryote, such as a mammalian, insect, plant, or fungal cell. In a 
preferred embodiment, the host cell is a fungal cell. TungP as used herein includes the 
phyla Ascomycota, Basidiomycota, Chytridiomycota, and Zygomycota (as defined by 
s Hawksworth et af., In, Ainsworth and Bisb/s Dictionary of The Fungi, 8th edition, 1995, CAB 
international. University Press. Cambridge. UK) as well as the Oomycota (as cited in 
Hawksworth ef a/., 1995, supra, page 171) and all mitosporic fungi (Hawksworth et a/., 1995. 
supra). 

io in a more preferred embodiment, the fungal host cell is a yeast cell. "Yeast" as used herein 
includes ascosporogenous yeast (Endomycetales). basidiospomgenous yeast, and yeast 
belonging to the Fungi Imperfecti (Blastomycetes). Since the classification of yeast may 
change in the future, for the purposes of this invention, yeast shall be defined as described 
in Biology and Activities of Yeast (Skinner, FA. Passmore. S.M., and Davenport. R.R., eds. 

15 Soc. App. Bacterial. Symposium Series No. 9, 1980). 

in an even more preferred embodiment, the yeast host cell is a Cancffda. Hansenula, 
Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia cell. In a most 
preferred embodiment, the yeast host cell is a Saccharomyces carisbergens,*, 
20 Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomyces douglasii, 
Saccharomyces kluyveri. Saccharomyces norbensis or Saccharomyces oviformis cell. In 
another most preferred embodiment, the yeast host cell is a Kluyveromyces lactis cell. In 
another most preferred embodiment, the yeast host cell is a Yarrowia lipolytica cell. 

25 in another more preferred embodiment, the fungal host cell is a filamentous fungal cell. 
"Filamentous fungi" include all filamentous forms of the subdivision Eumycota and Oomycota 
(as defined by Hawksworth ef a/.. 1995, supra). Tne filamentous fungi are characterized by 
a mycelial wall composed of chWn, cellulose, glucan. chitosan, mannan, and other complex 
polysaccharides. Vegetative growth ts by hyphal elongation and carbon catabolism Is 

3 o obligately aerobic. In contrast, vegetative growth by yeasts such as Saccftaromyces 
cerevisiae is by budding of a unicellular thallus and carbon catabolism may be fermentative. 
In an even more preferred embodiment, the filamentous fungal host cell is a cell of a species 
of, but not limited to, Acremonium. Aspergillus, Fusarium, Humicola, Mucor, Myceliophthora, 
Neurospora, PenhHlium, Thielavia, Tofypodadium, or Trichoderma. 

" In a most preferred embodiment, the filamentous fungal host cell is an Aspergillus awamori. 
Aspergillus foetidus. Aspergillus japonicus, Aspergillus nldulans. Aspergillus niger or 



10292. OOO-DK 

12 



w»n is a Fusami/n bacrndfoides. R/sam/m cereate, R/sanum cro 
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Hinneneta/., 1978. Proceedings of the National Academy of Sciences USA 75. 

• rr.-rr=rrzr=irrrrr 
r=rr=rrrrrr.rrr=: 

Esch enc/„a am ^ uef ac/ens. fee*. *«* 

Bac///us thuringiensis. 

ha ^en*od^^.n»e^nra^ to era.Mo.«,e«re.a S p.c..^n«^oe, 

• brr~rrrr'=rrr-rr 

Slw- «— » ~ 



10292. 000-DK 

13 



Myoeliopmora. Neurospora, PaecBomy**. Penman, Payees 

sZoMlurn. Taiaromyces. Tnenncxscus. TofypocWun,, or Tnttoderma; more 

preferably the fungal host cel. is of the species Sacc^myces cerate. Aspens 
lulestus. Aspergillus mamori, Aspergillus nMens, Aspergillus niger. or AspergiHus 
s oryzae. 

,n yet another preferred embodiment the invention relates to a method of the first aspect, 
wherein the host cell is mammalian. 

10 When a host ceO has been selected according to the method of the invention, it is of interest 
to identify the polynucleotide into whichthe secretion reporters inserted. There are many 
ways in the art that a..ow such an identification, e.g. hybridizations, probing, subdomng efc 
An efficient way is to perform a polynudeotide sequencing of the DNA fiankingthe inserted 
DNA fragment, the sequence of which is already known. So. a preferred embodlmen of ha 

„ invention relates toamethod of the first aspect, wherein the identifying of the polynudeotide 
in step f) is done by DNA sequencing using at least one primer directed to the DNA fragment 
of step c). or using at .east one primer directed to the vector of step a); preferably where 
isolating the polynudeotide in step f) is done by utilizing the DNA sequence information 
obtained. 

20 once sequence information has been obtained on the polynucleotide encoding the 
polypeptide of interest, then the complete encoding gene may be isolated, either from the 
genome of the originating organism, or from the previously established library. Accordingly^ 
preferred embodunent of the invention relates to a method of the first aspect, wherem the 

25 polynucleotide in step f) is isolated from the genome of the organism producing the 
polypeptide of interest, or from a DNA or cDNA library of the organism. 

A great number of secreted or partially secreted polypeptides are of commerdal or industrial 
interest A preferred embodiment of the invention relates to a method of the first aspect. 
30 wherein the polypeptide of interest is an enzyme that is secreted from the host cell. 

Another preferred embodiment of the invention relates to a method of the first aspect, 
wherein the polypeptide of interest is a membrane-bound receptor, preferably a two- 
component signal (TCS) transduction receptor, and more preferably a cytokine receptor. Yet 
as another a preferred embodiment of the invention relates to a method of the first aspect 
wherein the polypeptide of interest is a secreted cytokine. Still another preferred 
embodiment of the invention relates to a method of the first aspect wherein the polypeptide. 
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of interest is a polypeptide which elicits an immunogenic response in humans. A preferred 

of interest has anUrnicrooia. ao^ty. or wherein the po.ypept.de of interest ,s a plant 
pathogenic polypeptide. 

5 The indusbial route to producing toe poiypepdde of M«es. * —» - - 

^b^Ln^es „ a metood o. toe M a*** — - - — J* * 
!lruc»n 9 an explosion ^ is penned, sa« expression sysfcn, eon^stn, to. 

10 polynucleotide isolated in step f). 

I second asped of (he Invention relates to e polynudeotid. encoding a pot-pep*, of 
JTwhel satd pof^a * ieoiafad » I"- — * 

■M aspect rettes .o a po*pep«d. of M which Is echoed by a pd»nude*de ae 
is defined in the second aspect 

lance of me present invenaon ope^linted to one or mo* control 
J0 ZTIL expreLon of toe co*9 sequence in a suaable host eel, under condone 
! ° W w« be underdo* to indude any** 

LL, * the produdton of toe potypep«de tocWtng. bat not iinated to. fransonpbon. poat- 
rranscdptlonal moditication. WnsMon. pod-franslatlonal nrodificaeon. and secreaon. 

25 -Expression consul, 'expression vedd", or -exp-essic evstotn- ate *d 
«** he* and m defined a. a r«*fc add nrdecute, erthe, ang*. or 
Zl VlM is iaotated fron, a M occurrtog gene or a*.* has baen modHied to 
Z2 segn-ente of n.de,c add coined and Jaxtapoaad in a manner toa woUd « 
L. to nato™. The tone nudeic add condxud is 
3 . expreeslc casse«e when .he nuddc add condrud eontotos a. toe codrd 
' ^dred tor excess*, o, a sod** sequent of .ha present in»endon. The term «* 
Nuance- is «M he* as a nudeie add sequence which direct* apexes toe a™. 
J, sequence of «a p™* product The boundaries of toe cdfng sequence a* eneml* 

J5 ^ m upshot of the open readtog frame at toe r end of toe mRNA an I. 
a^aoipL MM. sequence toasted Jus, downstream of toe open read,ng franra a, toe 
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ff end of the mRNA A coding sequence can include, but Is not limited to, DMA, cONA, and 
recombinant nucleic acid sequences. 

An isolated nucleic acid sequence encoding a polypeptide of the present invention may be 
. ™^rd^va^o,wa»s to p,o,ide fo rexprea..no,orepo W ep«de Mammon 
rZuc«cec« seance prtocto K, *sen« ,n» a, edoo may be desire,* or necessary 
depending on the expression uednr. The techniquea for modifying nudeio ecu. eequencea . 
utilizing recombinant DMA methods are well known In the art 

10 The tern, -conbol sequenced ia defined herein to Indude all oomponents which ere 
ZZ or ad.-aqe.ue fa, fhe expreaafcn o, e *typep»de o, the present — 
L coll sequence may be n*h, « fcsefcn re «. nucieic acid aequenc. 
poWep.de. Such conbn, ee^nce, indude. but are no. h*d to. a head* 
aeouence. prop^bd. sequence, premolar, atone. p*pbde sequence, m* 

„ LLpTn tenninab. At a mkdreum, me eonbo, s«,u.nces btdude a prereote. and 
Iscdpbonalandfrenslabona^pe^. T» eon,* eequencea -nay be P«^ 

2 -opera.* linked" ie define herein as a oonfgurebon in whteh a conbo, sequence « 
that .he contre. sequence direds the expression of a potypephde. 

The conbo, sequence rney be en appropdate psomefcr sequence, e nudeio ecid sequence 
which is recognized by a host call lor expression of .ha nucleic acd sequence. The 

of ft. POWpepbde. The pcomottr. ma, be any nude* acid sequence whrit 

pdypepbdes eitoer homologous or hetenxogoue to the host cel. 

" Srempb* o, subabre prendre to deeding me franacnpbon of ft. nucleic acid <xx*rucU 
o. ft. present imrenbon, espedaly in a bad.de, boa, cab. are ft. premiers obfcmed mm 
me E 1 tec operon. Sfreptoreyces coefcoior agarasa gene Wag/,). Sac** sub* 
tevaneucreae gene (sacB). Baofce »e*m* alpha-amytoee gene («*). MM 

3S larefbenhopL n-ogenic amylaa. _ »«. B— 

areyfase genefan^, — ^ T,T« 

and xy<S genaa. and preba.yo.ic bebstadarease gene (vllla-Karnaroff a. at. W8, 
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Proceeds of to National Academy of Menses USA 75: 3727-3731) as we., .ft. m 
promoter (DeBoer et a,., 1983. M ****** " *"»■" » 
21-25) Further promoters are described in "Useful proteins from recombmant bactena in 
Sdenm: American, 1980, 242: 74-94; and in SambrooK J- et el., 1989, Mo/ecu/ar Coning. 
5 A Laboratory Manual, 2d edition, Cold Spring Harbor, New York. 

--^^^ ^^ tor dTOgfte transcnpt.cn o l U.e nOcBE uud cons^— 
of the present invention in a filamentous fungal host cel. are promoters obtained from the 
genes for Aspergfflus oryzae TAKA amylase. Rhizomucor miehei aspartic protemase. 
„ AspergVus ni 9 ar neutral amylase, Aspergillus niger acid stab,a a.p h a-amy^. 
AsoergHlus n/ger or Aspergillus ammofi g.ucoamy.ase (g/aA). Rf— or m/ene/ lipase. 
Aspens oryzae alkaline protease, ^ oryzae triose phosphate isomer^ 
Aspe/gto a/Cu/ans acetamidase, and Fusarium oxysporum trypsin-like protease (WO 
96/00787). as well as the NA2-t P i promoter (a hybrid of the promoters from the genes for 
15 Aspergillus niger neutral alpha-amylase and Aspergillus oryzae triose phosphate isomerase), 
and mutant, truncated, and hybrid promoters thereof. 

,n a yeast host, useful promoters are obtained from the genes for SaccHaromyces cerevisiae 
enCase (EN 0-1), Saccnaromyces cerevisiae galactokinase (GAL1), SaccAaromyces 
20 cerevisiae alcoho. dehydrogenase/g^ceraWehyde-3-phosphate dehydrogenase 
(ADH2/GAP), and Saccftaromyces cerevisiae 3-phos P boglycerate kinase. Other useful 
promoters for yeast host cells are described by Romanes et a/.. 1992. Veasf 8: 423-188. 

The control sequence may also be a suitable transcription terminator sequence, a sequence 
25 recognized by a host cell to terminate transcription. The terminator sequence is open* 
linked to the 3' terminus of the nucleic acid sequence encoding the polypept.de. Any 
terminator which is functional in the host eel of choice may be used in the present invention. 

Preferred terminators for filamentous fungal host cells are obtained from the genes for 
3 o Aspergillus oryzae TAKA amylase, Aspergillus niger g.ucoamy.ase, Aspergillus niaulens 
anthranHate synthase, Aspergillus niger alpha-glucosidase, and Fusarium oxysporm, 
trypsin-like protease. 

Preferred terminators for yeast host cells are obtained from the genes for Saccn aromyces 
35 cerevisiae enolase, Saccnaromyoes cerevisiae cytochrome C (CYC1), and Saccnaromyces 
cerevisiae g lyceraldehyde-3- P hos P hate dehydrogenase. Other useful terminators for yeast 
host cells are described by Romanos ef a/.. 1992, supra. 
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The control sequence may also be a suitable leader sequence, a retranslated region of an 
mRNA which is important for translation by the host cell. The leader sequence is operably 
finked to the 5' terminus of the nucleic acid sequence encoding the polypeptide. Any leader 
5 sequence that is functional in the host cell of choice may be used in the present .nvenhon. 

Preferred leaders for filamentous funga. host cells are obtained from the genes for 
Aspergillus oryzae TAKA amylase and Aspergillus nidulans triose phosphate isomerase. 

10 Suitable leaders for yeast host cells are obtained from the genes for Saccharamyces 
cerevfeiae enolase (ENO-1), Saccharomyces cerevistee 3-phosphoglycerate kmase, 
Saccnaromyces ce/ews/ae alpha-factor, and Saccnaromyces cerevisiae alcohol 
dehydrogenase/glyceraldehyde-3-phosphate dehydrogenase (ADH2/GAP). 

a5 The control sequence may also be a polyadenylation sequence, a sequence operably linked 
to the V terminus of the nucleic acid sequence and which, when transcribed, is recogn,zed 
by the host cell as a signal to add polyadenosine residues to transcribed mRNA. Any 
polyadenylation sequence which Is functional in the host cell of choice may be used In the 
present invention. 

20 

Preferred polyadenylation sequences for filamentous fungal host cefis are obtained from the 
genes for Aspergillus oryzae TAKA amylase. Aspergillus niger glucoamylase, Asperg^lus 
nidulans anthranilate synthase. Fusarium oxysporum trypsin-like protease, and Aspergillus 
niger alpha-glucosldase. 

25 useful polyadenylation sequences for yeast host cells are described by Guo and Sherman, 
1995, Molecular Cellular Biology f5: 5983-5990. 

It may also be desirable to add regulatory sequences which allow the regulation of the 
30 expression of the polypeptide relative to the growth of the host cell. Examples of regulatory 
systems are those which cause the expression of the gene to be turned on or off in response 
to a chemical or physical stimulus, including the presence of a regulatory compound. 
Regulatory systems in prokaryotic systems include the tec. tec. and ftp operator systems. In 
yeast, the ADH2 system or GAL1 system may be used. In filamentous fungi, the TAKA 
35 alpha-amylase promoter. Aspergillus niger glucoamylase promoter, and Aspergillus oryzae 
glucoamylase promoter may be used as regulatory sequences. Other examples of 
regulatory sequences are those which allow for gene amplification. In eukaryotic systems. 
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these include the dihydrofolate reductase gene which is amplified in the presence of 
methotrexate, and the metellothionein genes which are amplified with heavy metals. In 
these cases, the nucleic acid sequence encoding the polypeptide would be operably linked 
with the regulatory sequence. 

The present invention also relates to nucleic acid constructs for altering the expression of an 
-ena ogenous gene encoding a pu lyuBplluearTne present invention. The constructs may 



— •>+ 

contain the minimal number of components necessary for altering expression of the 
endogenous gene. In one embodiment, the nucleic acid constructs preferably contain (a) a 
10 targeting sequence, (b) a regulatory sequence, (c) an exon, and (d) a splice-donor site. 
Upon introduction of the nucleic acid construct into a cell, the construct inserts by 
homologous recombination into the cellular genome at the endogenous gene srte. The 
targeting sequence directs the integration of elements (a)-{d) into the endogenous gene 
such that elements (b)-(d) are operably linked to the endogenous gene. In another 
15 embodiment, the nucleic acid constructs contain (a) a targeting sequence, (b) a regulatory 
sequence, (c) an exon. (d) a splice-donor site, (e) an intron. and (f) a splice-acceptor site, 
wherein the targeting sequence directs the integration of elements (a)-(f) such that elements 
(bHf) are operably linked to the endogenous gene. However, the constructs may contam 
additional components such as a selectable marker. 

20 The introduction of these components results in production of a new transcription unit in 
which expression of the endogenous gene is altered. In essence, the new transcription unrt 
is a fusion product of the sequences introduced by the targeting constructs and the 
endogenous gene. In one embodiment in which the endogenous gene is altered, the gene is 

25 activated. In this embodiment, homologous recombination is used to replace, disrupt, or 
disable the regulatory region normally associated with the endogenous gene of a parent cell 
through the insertion of a regulatory sequence which causes the gene to be expressed at 
higher levels than evident in the corresponding parent cell. 

30 The constructs further contain one or more exons of the endogenous gene. An exon is 
defined as a DNA sequence which is copied into RNA and is present in a mature mRNA 
molecule such that the exon sequence is in-frame with the coding region of the endogenous 
gene The exons can, optionally, contain DNA which encodes one or more amino acids 
and/or partially encodes an amino acid. Alternatively, the exon contains DNA which 

35 corresponds to a 5' non-encoding region. Where the exogenous exon or exons encode one 
or more amino acids and/or a portion of an amino acid, the nucleic acid construct is 
designed such that, upon transcription and splicing, the reading frame is in-frame with the 
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coding region of the endogenous gene so that the appropriate reading frame of the portion 
of the mRNA derived from the second exon is unchanged. 

The splice-donor site of the constructs directs the splicing of one exon to another exon. 

5 Typically, the first exon lies 5' of the second exon, and the splice-donor site overtappmg and 
flanking the first exon on its 3' side recognizes a splice-acceptor site flanking the second 
exon on the 5' side of the second exon. A splice-acceptor site, like a splice-donor s.te, .s a 
sequence which directs the splicing of one exon to another exon. Acting In conjunction with 
a splice^onor site, the splicing apparatus uses a splice-acceptor site to effect the remove, of 

io anintron. 

A fourth aspect relates to an expression system comprising a polynucleotide as defined in 
the second aspect A fifth aspect relates to a host cell comprising at least one copy of a 
polynucleotide as defined in the second aspect, or an expression system as defined m fourth 
15 aspect 

Many ways have been described in the art to construct host cells comprising several stable 
copies of a polynucleotide of interest, both as independently replicating extrachromosomal 
entities and as chromosomally stably integrated copies. A preferred embodiment of the fifth 
20 aspect relates to the host cel.. wherein at least two copies of the polynucleotide as defined <n 
the second aspect are chromosomally integrated. 

Process of Production 

The present invention also relates to processes for producing a polypeptide of the present 
25 invention comprising (a) cultivating a strain, which in its wild-type form is capable of 
producing the polypeptide, to produce a supernatant comprising the polypeptide; and (b) 
recovering the polypeptide. 

The present invention further relates to methods for producing a polypeptide of the present 
30 invention comprising (a) cultivating a homologously recombinant cell, having Incorporated 
therein a new transcription unit comprising a regulatory sequence, an exon. and/or a spl.ce 
donor site operably linked to a second exon of an endogenous nucleic acid sequence 
encoding the polypeptide, under conditions conducive for production of the polypept.de; and 
(b) recovering the polypeptide. The methods are based on the use of gene activate 
35 technology, for example, as described in U.S. Patent No. 5.641.670. 
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In the production methods of the present invention, the cells are cultivated in a nutrient 
medium suitable for production of the polypeptide using methods known in the art For 
example, the cell may be cultivated by shake flask cultivation, small-scale or large-scale 
fermentation (Including continuous, batch, fed-batch, or solid state fermentations) in 
laboratory or industrial fermentors performed in a suitable medium and under conditions 
allowing the polypeptide to be expressed and/or isolated. The cultivation takes place in a 



suitable nutrient medium compri sing carbon and nitrogen sources and inorganic salts, using 

procedures known in the art. Suitable media are available from commercial suppliers or may 
be prepared according to published compositions (e.g., in catalogues of the American Type 

10 Culture Collection). If the polypeptide is secreted into the nutrient medium, the polypeptide 
can be recovered directly from the medium. If the polypeptide is not secreted, it can be 
recovered from cell lysates. 

The polypeptides may be detected using methods known in the art that are specific for the 
is polypeptides. These detection methods may include use of specific antibodies, formation of 
an enzyme product, or disappearance of an enzyme substrate. For example, an enzyme 
assay may be used to determine the activity of the polypeptide as described herein. 

The resulting polypeptide may be recovered by methods known in the art. For example, the 
20 polypeptide may be recovered from the nutrient medium by conventional procedures 
including, but not limited to, centrifugation, filtration, extraction, spray-drying, evaporation, or 
precipitation. 

The polypeptides of the present invention may be purified by a variety of procedures known 
25 in the art including, but not limited to, chromatography (e.g., ion exchange, affinity, 
hydrophobic, chromatofocusing, and size exclusion), electrophoretic procedures (e.g., 
preparative isoelectric focusing), differential solubility (e.g.. ammonium sulfate precipitation), 
SDS-PAGE, or extraction (see, e.g., Protein Purification, J.C. Janson and Lars Ryden, 
editors, VCH Publishers, New York. 1989). 

30 

A final aspect relates to a process for producing a polypeptide of interest, comprising 
cultivating a host cell as defined in the previous aspects under conditions suitable for 
expressing the polynucleotide of the thirs aspect, wherein said host cell secretes the 
polypeptide encoded by said polynucleotide into the growth medium. A preferred 
35 embodiment relates to the process of the previous aspect, wherein the polypeptide is an 
enzyme or a polypeptide having anti-microbial activity. Another preferred embodiment 
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relates to the process of the previous aspect, where an addiuonal step of purifying the 
polypeptide is performed. 



5 T^eTused herein is denoted pMhasS. and the nucleotide sequence is shown in SEQ 
ID N0:1. The vector has the following features: 



Feature 


Location 


Description 


CDS 


365-1156 


Kanamycin resistance 


CDS 


2232-2387 


Beta gaiactosidase alpha peptide 


-10 signal ' 


2189-2192 


Shine Dalgamo 


Promoter 


2101-2189 


Lac promotor 


misc feature 


626-650 


KanP1 primer for BACE system 



Surprising rolling circle amplification worked on plasmid DNA isolated from E. col, but no 
on the ligation mixture that was used to transform the E. cofi. We determined in a series of 
experiments, that the reason for why the ligated DNA was not susceptible to rollmg crcle 
amplified, was that pyridine dimers were present in me ligated DNA The pynm, ,ne 
15 dimers were formed by exposure of the fragment and plasmid DNA wrth u.trav,o«et l.ght 
before ligation. 

A plasmid library was diluted to 1 ng/pl DNA in TE buffer ( P H8.0). 10 p. of the dOuted plasmid 
pool was spotted onto a UV source (Eagle Eye™ II UV transilluminator. Stratagene. USA), 

20 which was adjusted to preparative mode (UV 360nm). 2 pi samples were taken every 30 
seconds from the irradiated DNA pool, starting with time=0 immediately before turnrng ttie 
UV light on. 10 p. Aniersham TempliPhi™ denaturing buffer was then added to each sample 
and these were then heated to 95°C for 3 minutes. 10 pi of TempliPhi™ enzyme mbc was 
then added to each sample and the reaction was incubated at 30°C for 8 hours. The 

2B samples were subsequently denatured at 95°C for 5 minutes, and then stored at 4 C. 

Agarose gel loading buffer was added to the samples which were then analyzed in 1% 
agarose gel etectrophoresis according to standard methods. The resuKs showed that the 
amount of amplification product was significantly reduced in the samples taken after only 30 
30 seconds UV treatment, decreasing until no amplification product was detectable m the 
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sample taken after two minutes of UV irradiation. Presumably modifications of the template 
DNA had taken place, either by nicking, depurination or pyrimidine dimer formation. The 
result demonstrated that it is essential to avoid UV treatment of templates to be used in 
rolling circle amplification. 



Library preparation 

5 pg ot piasmia pitffiisS was restricted with EcoRI and Nat\ according to the manufacturers 
instructions (New England Biolabs, USA). The restricted plasmid was loaded in a standard 
1% TBE agarose gel and dectrophoresed in order to separate the stuffer fragment from the 

10 vector. Half of the get, containing the DNA marker (1Kb ladder , BRL, USA), and the region 
corresponding to approximately 1 mm of the 10 mm total width of the sample lane, was 
separated using a ruler as a guide. The separated part of the gel was stained with Ethidium 
bromide and visualized on a UV light source. The band corresponding to the cut vector (ca. 
2.6 kb) was marked by making a notch in the gel with a scalpel blade. The two gel parts 

15 were aligned and the non UV treated portion corresponding to the cut plasmid was isolated. 
A standard GFX™ purification (AP Biotech) was used to remove agarose, restriction enzyme 
and contaminants. The restricted plasmid was eluted in 50 uls 10mM Tris buffer pH B.O and 
stored at -20°C until further use. 

20 A cDNA library was prepared from Rhizomucor pusiiius induced for 5 days on Mex-1 media, 
as shown in the examples of WO 98/38288 and in the examples of international patent 
application WO 01/12794. The protocols of these PCT publication were used as described, 
except that care was taken to eliminate all steps involving UV treatment of the RNA, and 
DNA during the standard procedures. No size fractionation was performed on the RNA, first 

25 strand or second strand cDNA. Ligation of the cDNA into pMHas5 was performed according 
to WO 01/12794. Briefly, 2 pi of vector corresponding to 40 ngs pMHasS EcoRI-Notl 
restricted plasmid was used with 6 pi corresponding to about 100 ng of non size-fractionated 
double-stranded cDNA. 



5 
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pMHasS 
cDNA 

10Xlig. Buffer 
T4 DMA llaase (310 

5 

The ligation was incubated at 16°C overnight and then heat-treated at 65°C for 20 minutes. 
10 pi of dH20 was added to the ligation which was then stored at ? 20°C until further use. 

io Rolling circle amplification 

1 M> of ligated cDNA library corresponding to about 10 ng total DNA was added to 20 \i\ 
TempliPhi™ denaturing buffer (Amersham Biotech). The sample was then heated to 95°C 
for 3 minutes and then placed on ice. 10 Ml of TempliPhi™ premix was added to the sample 
which was incubated for 8 hours at 30°C. The sample was then heated to 95°C for 5 

is minutes, cooled at 4°C and stored at -20°C until further use. 

3 Ml sample material was restricted with Nott in a total volume of 10 pi under standard 
conditions (New England Biolabs). 1 \A of the restriction digest mix was visualized on a 1% 
TBE agarose gel. The gel indicated that amplification had occurred. As expected, a diffuse 
20 band between 2.8 and 3.5 kb was observed with tailing into higher molecular weights. This 
indicated that plasmids with different size inserts were successfully amplified by the method, 
and that cutting with the unique rarely cutting enzyme Wofl created linear monomer plasmids 
containing the inserts. 

2S The remaining 9 Ml of the restriction digest mix was purified on a GFX™ column to remove 
buffer and enzyme. The sample was eluted from the column with 50 pi buffer <10mM Tris, 
pH8.0). 5 pi of the purified sample was analyzed by agarose gel electrophoresis to verify that 
the amplification product was still present. 20 pi of the purified sample was then used in a 
standard ligation: 

30 

GFX purified digest of material: 20 pi 
10X lig. Buffer 10 Ml 
H20 69 Ml 
J4DMA liaase f3U/ufl LuL 



2 Ml 
6 Ml 
1pl 
JUi 
10 Ml 



35 



100 pi 
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The ligated plasmid is expected to contain mainly open circular plasmid monomers with 
insert. 930 pg of the ligation was used to transform E. coli DH10B electrocompetent cells. 10 
pg diluted supercoiled pUC19 monomer DNA was used as a control. The following results 
were obtained: 

5 

transformation transformartts/uq DNA 

ligation 5^X16°" " ~~ 

pUC19 8.6X10 a 

10 This result clearly demonstrates that a library can be made by in vitro rolling circle 
amplification. The lower transformation frequency of the treatment results primarely from the 
fact that the Ogation is relaxed plasmid DNA, and not supercoiled as is the pUC19 control. In 
the literature, the drop in transformation efficiency is reported to be between 100- and 1000- 
fold, in keeping with what we observed. 

15 

Transposon assisted signal trapping and suoercoiling 

Transposition with a SigA2 transposon was carried out as described in detail in WO 
01^77315, with the following modifications: A period of 2 hours at 30°C was used for the 
transposition reaction. Then the sample was split into two portions. The first was treated with 
20 a DNA topoisomerase (DNA gyrase) according to the manufacturers instructions (John Innes 
Enterprizes, England). The second half was left untreated. 

It was expected that by supercoiling the plasmid through treatment with a DNA gyrase, one 
can increase the efficiency of transformation into E. coli by 100-fold, This expectation was 
25 confirmed. 

Using RecA recombinase to create monomers 

An efficient way to create monomers from a rolling circle amplified library is to treat the 
concatameric DNA with a recombinase enzyme, such as the commecially available RecA 

30 enzyme from £ coli. The result of such a treatment is the looping out of plasmid monomers 
from the very long linear amplification products of rolling circle. The method has the 
advantage that it is not dependent on the use of restriction enzymes that may inadvertently 
cut the inserted library DNA. The recombinase step may be performed at either 
simultaneously with the rolling circle amplification, after the amplification, or after transposon 

35 treatment- 



Reduction of amplification bias by use of vector specific primer combinations 
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Although this has not been observed to any large degree, in our experience, selective 
amplification bias may occur based on differences in the availability of priming sites for the 
random oligomers used in the Amersham TempliPhi™ premix. In order to reduce this 
potential bias, several primers specific to the doning vector may be used. As described in 
5 Dean et at. (2001, Rapid Amplification of plasmid and phage DNA using Phi29 DNA 
polymerase and multiply-primed rolling circle amplification. Genome Research 11:1095- 
1099), many primers, priming on both DNA strands are preferable to a one or few primers. In 
a preferred embodiment, a mixture of 10-20 or more primers are used with an even 
distribution on the each strand of the cloning vector. 



10 
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CLAIMS 

1. A method for isolating a polynucleotide that encodes a polypeptide of interest which 
comprises a signal sequence for secretion or partial secretion, the method comprising the 
sequential steps of. 

s a) providing a DNA or cDNA library from an organism producing the polypeptide of 
interest, wherein the library is comprised in a circular vector and is produced in vitro 
without ultraviolet irradiation of the component polynucleotides; 

b) amplifying the library by rolling circle amplification, thereby forming concatamers; 

c) inserting into the library a DNA fragment comprising a promoterless and secretion 
10 signal-less polynucleotide encoding a secretion reporter; 

d) introducing the amplified library comprising the inserted DNA fragment into a host cell; 

e) screening for and selecting a host cell that secretes or partially secretes the active 
secretion reporter and 

f) identifying from the selected host ceil the polynucleotide into which the secretion 
15 reporter was inserted, and isolating the polynucleotide; 

wherein steps b) and c) may be performed in any order. 

2. The method of claim 1 , wherein the cDNA or the cDNA library is normalized. 

20 3. The method of claim 1 or 2, wherein the genomic DNA library or cDNA library is derived 
from a microorganism. 

4. The method of claim 3, wherein the microorganism is a fungus, a filamentous fungus or a 
yeast. 

25 

5. The method of claim 3, wherein the microorganism is a bacterium. 

6. The method of daim 3, wherein the microorganism is an archaeon. 

30 7. The method of claim 1 or 2, wherein the genomic DNA library or cDNA library is derived 
from a multicellular organism, preferably from a mammalian cell, most preferably from a 
human cell. 

8. The method of any of claims 1-7, wherein the vector comprises at least one restriction 
35 enzyme cleavage site and/or at least one cos site and/or at least one recombination 
recognition site. 
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9. The method of any of claims 1 - 8, wherein step c) is performed in vitro. 

10. The method of any of claims 1 - 9, wherein the DNA fragment of claim 1 comprises a 
transposon, preferably a MuA transposon. 

5 11 The method of any of claims 1 - 10. wherein the DNA fragment comprises an origin of 
replication which is functional in the host cell, preferably the origin of replication is functional 
in Escherichia cdi, more preferably the origin of replication is a derivative of colE1, oriV, 
P1 5A, or colDFI 3, and most preferably the origin of replication is colE1 . 

10 12 The method of any of claims 1 - 11, wherein the secretion reporter is a protein which, 
when secreted from the host cell, allows said cell to grow in the presence of a substance 
which otherwise inhibits growth of said cell. 

15 13. The method of claim 12. wherein the secretion reporter is a 6-lactamase or an invertase. 

14 The method of any of claims 1 - 13, wherein the polynucleotide of the DMA-fragment of 
steb (b) encodes a secretion reporter carrying an N-terminal peptide linker which comprises 
a specific target site for proteolytic cleavage. 



20 



15. The method of any of claims 1 - 14, wherein the amplified library concatamers are 
converted to monomers before performing step d). 

16. The method of claim 15, wherein the vector comprises at least one restriction enzyme 
25 recognition site, and the concatamers are converted to monomers by restriction enzyme 

digestion and then circularized by ligation. 

17 The method of claim 15, wherein the vector comprises at least one recombination 
recognition site, and the concatamers are converted to monomers by circularization through 
30 homologous recombination, mediated by the recognition sites and a specific recombinase. 

18. The method of any of claims 15 - 18. wherein the monomers are circularized and then 
treated with a DNA topoisomerase. 

as 19 The method of any of claims 1 - 14, wherein the vector comprises at least one cos site, 
and wherein subsequent to steps b) and c) the amplified library concatamers are converted 
to monomers prior to step d) by cos site cleavage during phage-packaging. 
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20. The method of any of claims 1 - 14, wherein the library is introduced into the host cell as 
concatamers. 

5 21 . The method of any of claims 1 - 20, wherein the host cell is bacterial. 



52. The method of claim 21, wherein the bacterial host cell is an Escherichia. Lactococcus, 
Streptomyces, Enterococcus or Bacillus cell, preferably of the species Escherichia cafe 
Lactococcus lactis, Streptomyces griseus, Streptomyces coeficor, Enterococcus faecalis, 
10 Bacillus 8lkalophilus f Bacillus amyloliquefaciens, Bacillus brevi$ t Bacillus circulans, Bacillus 
clausii t Bacillus coagulans, Bacillus lautus. Bacillus lentus, Bacillus Hcheniformis, Bacillus 
megaterium, Bacillus stearotheimophilus, Bacillus subtilis, or Bacillus thuringiensis. 

23. The method of any of claims 1 - 20, wherein the host cell is fungal. 



24. The method of claim 23, wherein the fungal host cell is of the genus Candida, 
Kluyveromyces. P/c/i/a, Saccharvmyces, Schizosaccharomyces, Yarrowia, Acremonium, 
Aspergillus, Aureobasidium, Cryptococcus t Filibasidium, Fusarium, Humicola t Magnaporthe, 
Mucor, Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Penicilllum, Piromyces t 

20 Schizophyllum, Talaromyces t Thermoascus, Thielavia, Tolypocladium, or Trichoderma. 

25. The method of claim 23, wherein the fungal host cell is of the species Saccharomyces 
cerevisiae, Aspergillus aculeatus, Aspergillus awamori, Aspergillus nidulans, Aspergillus 
niger t or Aspergillus oryzae. 

25 

26. The method of any of claims 1 - 20, wherein the host cell is mammalian. 

27. The method of any of claims 1 - 26 r wherein the identifying of the polynucleotide in step 
f) is done by DNA sequencing using at least one primer directed to the DNA fragment of step 

30 c), or using at least one primer directed to the vector of step a). 

28. The method of claim 27, where isolating the polynucleotide in step f) is done by utilizing 
the DNA sequence information obtained. 
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35 29. The method of any of claims 1 - 28, wherein the polynucleotide in step f) is isolated from 
the genome of the organism producing the polypeptide of interest, or from a DNA or cDNA 
library of the organism. 
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30. The method of any of claims 1 - 29, wherein the polypeptide of interest is an enzyme 
. that is secreted from the host cell. 

5 31. The method of any of claims 1 - 29, wherein the polypeptide of interest is a membrane- 
bound receptor, preferably a two-component signal (TCS) transduction receptor, and more 
preferably a cytokine receptor. 

32. The method of any of claims 1 - 29, wherein the polypeptide of interest is a secreted 
io cytokine. 

33. The method of any of claims 1 - 29, wherein the polypeptide of interest is a polypeptide 
which elicits an immunogenic response in humans. 

is 34. The method of any of claims 1 - 29, wherein the polypeptide of interest has antimicrobial 
activity. 

35. The method of any of claims 1 - 29, wherein the polypeptide of interest is a plant 
pathogenic polypeptide. 

20 

36. The method of any of claims 1 - 35, wherein an additional step of constructing an 
expression system Is performed, said expression system comprising the polynucleotide 
isolated in step f). 

as 37. A polynucleotide encoding a polypeptide of interest, wherein said polynucleotide is 
isolated by the method of the present invention. 

38. A polypeptide of interest which is encoded by a polynucleotide as defined in claim 37. 

30 39. Ah expression system comprising a polynucleotide as defined in claim 37. 

40. A host cell comprising at least one copy of a polynucleotide as defined in daim 37, or an 
expression system as defined in claim 39. 

35 41 . The host ceU of claim 40, wherein at least two copies of the polynucleotide as defined in 
claim 37 are chromosomally integrated. 
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42. A process for producing a polypeptide of interest, comprising cultivating a host cell as 
defined in claim 40 or 41 under conditions suitable for expressing the polynucleotide as 
defined in claim 37, wherein said host cell secretes the polypeptide encoded by said 
polynucleotide into the growth medium. 

43. The proc ess of claim 42, wherein the polypeptide is an enzyme or a polypep tide having 
anti-microb"ial activity. ~ ~~ 



44. The process of claim 42 or 43, where an additional step of purifying the polypeptide is 
10 performed. 
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TITLE: Methods for rolling circle amplification and signal trapping of libraries 
ABSTRACT 

A method for isolating a polynucleotide that encodes a polypeptide of interest which 
5 comprises a signal sequence for secretion or partial secretion, utilizing rolling circle 
amplification and signal trapping of libraries. 
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Modtaget 



<110> Novozymes A/s 
Schnorr, Kirk 

<120> Methods for rolling circle amplification and signal trappi 
<130> 10292. 000-DK 
<160> 1 

-<&Q> — Patentfri-vers ion 3.1 

<210> 1 
<211> 2403 
<212> ONA 

<213> Artificial sequence 
<220> 

<223> Plasmid pMHas5 
<400> 1 

cgataagcta gcttcacgct gccgcaagca ctcagggcgc aagggctgct aaaggaagcg 
gaacacgtag aaagccagtc cgcagaaacg gtgctgaccc cggatgaatg tcagctactg 
ggctatctgg acaagggaaa acgcaagcgc aaagagaaag caggtagctt gcagtgggct 
tacatggcga tagctagact gggcggtttt atggacagca agcgaaccgg aattgccagc 
tggggcgccc tctggtaagg ttgggaagcc ctgcaaagta aactggatgg ctttcttgcc 
gccaaggatc tgatggcgca ggggatcaag atctgatcaa gagacaggat gaggatcgtt 
tcgcatgatt gaacaagatg gattgcacgc aggttctccg gccgcttggg tggagaggct 
attcggctat gactgggcac aacagacaat cggctgctct gatgccgccg tgttccggct 
gtcagcgcag gggcgcccgg ttctttttgt caagaccgac ctgtccggtg ccctgaatga 
actccaagac gaggcagcgc ggctatcgtg gctggccacg acgggcgttc cttgcgcagc 
tgtgctcgac gttgtcactg aagcgggaag ggactggctg ctattgggcg aagtgccggg 
gcaggatctc ctgtcatctc accttgctcc tgccgagaaa gtatccatca tggctgatgc 
aatgcggcgg ctgcatacgc ttgatccggc tacctgccca ttcgaccacc aagcgaaaca 
tcgcatcgag cgagcacgta ctcggatgga agccggtctt gtcgatcagg atgatctgga 
cgaagagcat caggggctcg cgccagccga actgttcgcc aggctcaagg cgcggatgcc 
cgacggcgag gatctcgtcg tgacccatgg cgatgcctgc ttgccgaata tcatggtgga 
aaatggccgc ttttctggat tcatcgactg tggccggctg ggtgtggcgg accgctatca 
ggacatagcg ttggctaccc gtgatattgc tgaagagctt ggcggcgaat gggctgaccg 
cttcctcgtg ctttacggta tcgccgctcc cgattcgcag cgcatcgcct tctatcgcct 
tcttgacgag ttcttctgag cgggactctg gggttcgcga tgataagctg tcaaacatga 
gaattacaac ttatatcgta tggggctgac ttcaggtgct acatttgaag agataaattg 
cactgaaatc tagaaatatt ttatctgatt aataagatga tcttcttgag atcgttttgg 
tctgcgcgta atctcttgct ctgaaaacga aaaaaccgcc ttgcagggcg gtttttcgaa 
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ggttctctga gctaccaact ctttgaaccg aggtaactgg cttggaggag cgcagtcacc 1440 

aaaacttgtc ctttcagttt agccttaacc ggcgcatgac ttcaagacta actcctctaa 1500 
atcaattacc agtggctgct gccagtggtg cttttgcatg tctttccggg ttggactcaa 1560 
gacgatagtt accggataag gcgcagcggt cggactgaac ggggggttcg tgcatacagt 
ccagcttgga gcgaactgcc tacccggaac tgagtgtcag gcgtggaatg agacaaacgc 
ggccataaca gcggaatgac accggtaaac cgaaaggcag gaacaggaga gcgcacgagg 
gagccgccag gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc caccactgat 
ttgagcgtca gatttcgtga tgcttgtcag gggggcggag cctatggaaa aacggctttg 
ccttctttcc tgcgttatcc cctgattctg tggataaccg tattaccgcc tttgagtgag 
ctgataccgc tcgccgcagc cgaacgaccg agcgcagcga gtcagtgagc gaggaagcgg 
aagagcgccc aatacgcaaa ccgcctctcc ccgcgcgttg gccgattcat taatgcagct 
ggcacgacag gtttcccgac tggaaagcgg gcagtgagcg caacgcaatt aatgtgagtt 
agctcactca ttaggcaccc caggctttac actttatgct tccggctcgt atgttgtgtg 
gaattgtgag cggataacaa tttcacacag gaattcacag ctatgctaga gcggccgctc 
gacctgcagg catgcaagct tggcactggc cgtcgtttta caacgtcgtg actgggaaaa 
ccctggcgtt acccaactta atcgccttgc agcacatccc cctttcgcca gctggcgtaa 2340 
tagcgaagag gcccgcaccg atcgcccttc ccaacagttg cgcagcctga atggcgaatg 
gcg 
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